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Abstract. We consider the satisfiability problem for the two-variable 
fragment of first-order logic over finite unranked trees. We work with 
signatures consisting of some unary predicates and the binary naviga- 

i^ ■ tional predicates 4- (child), — >• (right sibling), and their respective tran- 

sitive closures 4+, — > + - We prove that the satisfiability problem for the 
q, logic containing all these predicates, FO 2 [4-, 4-+ > — ^5 — ^ + ] j is ExpSpace- 

^A • complete. Further, we consider the restriction of the class of structures 

to singular trees, i.e., we assume that at every node precisely one unary 

\& ■ predicate holds. We observe that FO 2 [4-, i+, — ►, — > + ] and even FO 2 [4+ , 4] 

C^ ' remain ExpSPACE-complete over finite singular trees, but the complexity 

decreases for some weaker logics. Namely, the logic with one binary pred- 
icate, 4+, denoted F0 2 [4+], is NExpTlME-complete, and its guarded ver- 
sion, GF 2 [4+], is PSPACE-complete over finite singular trees, even though 

I— ^ , both these logics are ExpSPACE-complete over arbitrary finite trees. 

CO 
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1 Introduction 

f^) ' Classical results from the 1930s by Church and Turing show that the satisfia- 

OQ , bility problem for first-order logic is undecidable. Moreover, undecidability can 

be proved even for the fragment with only three variables, FO 3 , [12] . This fact 
attracted the attention of researchers to the two- variable fragment, FO , which 
turns out to be decidable [T5] and NExpTlME-complete [8]. In particular, FO 2 
gained a lot of interest from computer scientists, because of its close connections 
to formalisms such as modal, temporal, description logics, and XML, widely used 
in various areas of computer science, including hardware and software verifica- 
k> ' tion, knowledge representation, databases, and artificial intelligence. 

j^ , The expressive power of FO is limited and is not sufficient to axiomatisc 

some natural simple classes of structures, such us trees or words. It is also not 
possible to say, e.g., that a binary relation is transitive, an equivalence or a linear 
order. Thus, FO over various classes of structures, in which certain relational 
symbols have to be interpreted in a special way, e.g., as equivalences, has been 
extensively studied (see, e.g., [9110120114115116] for some results in this area). 
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FO over words is investigated in [7]. The authors work there with signa- 
tures consisting of some unary predicates and two built-in binary predicates: 
succ for the successor relation and < for its transitive closure. The resulting 
logic, FO [succ,<], is shown to have NExpTlME-complete satisfiability prob- 
lem, both over w-words and over finite words. Actually, the lower bound can 
be shown for monadic FO , i.e., without using the binary relations succ and 
<. The elementary complexity of FO over words sharply contrasts with the 
non-elementary complexity of FO 3 over words which follows from [53] . 

In this paper we consider FO over unranked trees (ordered or unordered), 
assuming that, beside unary symbols, signatures may include the child relation 
4., the right sibling relation — ►, and their respective transitive closures 4-+ and 
— > + . Decidability of the satisfiability problem for FO 2 over various classes of 
infinite trees is implied by the celebrated result by Rabin [2j~j , that the monadic 
second-order theory of the binary tree is decidable. Over finite trees decidability 
follows from |llj . However, regarding complexity, the above mentioned results 
give only non-elementary upper bounds. A better upper complexity bound for 
the richest of the logics we consider, F0 2 [4, !_,., — ►, — > + ], can be obtained by 
exploring its correspondence to XPath. In [18] it is argued that F0 2 [|, 4-+, — >, -^ + ] 
is expressively equivalent to a variant of Core XPath which is shown in [17] to be 
ExpTiME-complete. As the translation to XPath involves an exponential blowup 
in the size of formulas, we get this way 2-ExpTime upper bound. Our first 
contribution is establishing the precise complexity of the satisfiability problem 
for F0 2 [4,, 4-+, — h -^ + ] over finite trees by showing that it is ExpSPACE-complete. 

Worth mentioning here is the work from [2], where two- variable logics over 
unranked, ordered trees with additional equivalence relation on nodes, denoted 
~, is proposed. The purpose of ~ is to model XML data values. It is argued that 
this extension of F0 2 [4, 4+, — >, — > + ] is very hard and its decidability is left as an 
open problem. On the positive side, decidability of F0 2 [4, — >, ~] is shown. 

In the context of XML reasoning it is natural to consider also the addi- 
tional semantic restriction that at a node of a tree precisely one unary predicate 
holds. We call trees meeting this assumption singular trees. In [24] an anal- 
ogous restriction for finite words is considered!^ It appears that FO 2 [succ, <] 
over finite singular words remains NExpTlME-complete, but F0 2 [<] becomes 
NPTiME-complete. In this paper we observe a similar effect in the case of un- 
ordered trees: over singular trees, F0 2 [4-+,|] remains ExpSPACE-hard, and the 
complexity of F0 2 [4_+] decreases. This time the complexity drop is slightly less 
spectacular, as the problem is NExpTlME-complete. We observe, however, that 
for NExpTiME-hardncss the ability of speaking about pairs of elements x, y in 
free position, i.e., such that y is neither an ascendant or descendant of x, is 
needed. This is not typical of logics used in computer science, as their atomic 
constructions usually allow to refer only to pairs of elements that lie on the same 
path. To capture the former kind of scenario we consider the restriction of FO 2 



3 In that paper a slightly different terminology is used: the term word denotes a 
structure meeting the singularity assumption, and the term power words is reserved 
for structures that allow for multiple unary predicates holding at a single position. 



to the two-variable guarded fragment, GF , in which all quantifiers have to be 
relativised by binary predicates. We observe that the satisfiability problem for 
GF 2 [J, + ] over finite singular trees is PSPACE-complete. To complete the picture 
we show that augmenting GF [4_ + ] with any of the remaining navigational pred- 
icates leads to ExpSPACE-hardness over singular trees. Thus, we establish the 
complexity over finite trees and over finite singular trees of all logics GF [run} 
and F0 2 [r bm ], for {| + } C T bin C {|,| + ,^,^ + }. 



2 Preliminaries 

Trees and logics. We work with signatures of the form r = tq U Tun, where 
To is a set of unary symbols and Tun ^= {-l, -1+, —*, ~^ + }- Over such signatures 
we consider two fragments of first-order logic: FO 2 , i.e., the restriction of first- 
order logic in which only variables x and y are available, and GF being the 
intersection of FO and the guarded fragment, GF [I] . GF is defined as the least 
set of formulas such that: (i) every atomic formula belongs to GF; (ii) GF is 
closed under logical connectives ->, V, A, =>; and (iii) quantifiers are appropriately 
relativised by atoms, i.e., if </?(x, y) is a formula of GF and a(x, y) is an atomic 
formula containing all the free variables of tp, then the formulas Vy(a(x, y) =>■ 
tp(x., y)) and 3y(a(x, y) Atp(x., y)) belong to GF. Atom a(x, y) is called a guard. 
Equalities x—x or x=y are also allowed as guards. 

For a given formula tp we denote by tq(p) the set of unary symbols that 
appear in p. We write F0 2 [t(,j„] or GF 2 [T(,i„] to denote that the only binary 
symbols that are allowed in signatures are those from Tbin- We are interested in 
finite unranked tree structures, in which the interpretation of symbols from Tbin 
is fixed: if available in the signature, 4- is interpreted as the child relation, — > as 
the right sibling relation, and 4+ and — > + as their respective transitive closures. 
If at least one of — >, — > + is interpreted in a tree then we say that this tree is 
ordered] in the opposite case we say that the tree is unordered. 

We use x^y to abbreviate the formula stating that x and y are in free po- 
sition, i.e., that they are related by none of the binary predicates available in 
the signature. E.g., if we consider ordered trees over Tbin — {I, i+, — >•, — > + } then 
X'fty can be defined as x^y A -i(x\. + y) A -i(y! + x) A ->(x— > + y) A -*(y—t + x); for 
unordered trees over Tun = {1+} it is just xj^y A -i(x4-+y) A ~^(y-l + x). 

Let us call the formulas specifying the relative position of a pair of elements 
in a tree with respect to binary predicates order formulas. There are ten possible 
order formulas: x\y, yix, x-l + y A -^(x^y), yi+x A -i(y],x), x^-y, y—yx, x— > + y A 
-i(x—yy), y^f + x A~i(y— >x), x^y, x=y. They are denoted, respectively, as: 9±, 6^, 
0-l4+> ^tt + ' 0-*i ®4—> ^=s+' ^5=+i ^> ^=- Let be the set of these ten formulas. 

A structure over a signature t = tq U Tbin is singular if at every element of 
this structure precisely one unary predicate from To holds. We say that a formula 
tp is singularly satisfiable (over a class of structures C) if there exists a singular 
model of tp (from C). 

We use symbol T (possibly with sub- or superscripts) to denote tree struc- 
tures. For a given tree T we denote by T its universe. A tree frame is a tree 



over a signature containing no unary predicates. We say that a formula ip is 
(^singularly) satisfiable over a tree frame T if *J |= p for some (singular) 1 such 
that T is the restriction of 1 to binary symbols. 

Normal form. We say that an FO [4-, i-+,—y, — > + ] formula <p is in normal form 
if <p = Vxyx(x,y) A A l£ / Vx(Aj(x) ^> 3y(r]i(x,y) A ipi(x,y))), for some index 
set /, where x( x ;2/) is quantifier-free, Aj(x) is an atomic formula a(x) for some 
unary symbol a, ipi(x,y) is a boolean combination of unary atomic formulas, 
and r]i(x, y) is an order formula. Please note, that in x the equality symbol may 
be used, e.g., we can enforce that a model contains at most one node satisfying 
a: Vxy(a(x) A a(y) =>• x=y). The following lemma can be proved in a standard 
fashion (cf. e.g., [To]). 

Lemma 1. Let ip be an FO [4-, 4-+ 7 — >, — > + ] formula over a signature r and let 
T be a tree frame. There exists a polynomially computable FO [4-, 4- + , — >, — > + ] 
normal form formula p' over signature t' consisting of r and some additional 
unary symbols, such that tp is satisfiable over T (singularly satisfiable over T) 
iff p> is satisfiable over T (satisfiable over T in a model that restricted to r is 
singular). 

Consider a conjunct ipi = Vx(Aj(x) => 3y(i]i(x,y) A ipi(x,y))) of a normal 
form F0 2 [4-, 4-+, — », — > + ] formula ip. Let % \= p>, and let v G T be an element such 
that T |= Xi[v]. Then an element w G T such that T \= r}i[v,w] Aipi[v, w] is called 
a witness for v and <pn. Sometimes, b is called an upper witness if r/i(x, y) \= y]. + x, 
a lower witness if T)i(x, y) \= x4- + y, and a free witness if rji{x, y) (= x^y. 

Types. A (atomic) 1-type, over a signature r = To U Tbin, is a subset of To- We 
often identify a 1-type a with the formula f\ aGa a(x) A A a £a ~~*o,(x). F° r a given 
r-tree T, and v G T, we denote by tp 2 ^) the 1-type realized by u, i.e., the 
unique 1-type a such that T |= a[v]. 

A /m?/ tj/pe is a function a : — y V(t ), such that «(#-[-), a (#_,.), a{9^) 
are singletons or empty, a{9 = ) is a singleton, and if a(6f) (respectively a(9±), 
a(9<-), a(9->)) is empty then a(9^+) (respectively a($^ + ), a(9^+), a(9^+)) is 
also empty. We employ the following convention: for a given full type a we denote 
by a the unique member of a(6 = ). For a given r-tree 1, and v G T, we denote 
by ftp (i>) the full type realized by u, i.e., the unique full type a, such that a is 
the 1-type of v, and for all 9 G 6* we have that a(0) = {tp^iu) : T (= 0[u, w]}. 

A reduced full type is a tuple (a, A, i?, F), where a is a 1-type and A,B,F 
are sets of 1-types. Reduced full types are used to keep information recorded in 
full types in a slightly (lossy) compressed form. Let ftp' r (w) = a. By rftp' r (w) we 
denote the reduced full type realized by v, i.e., the reduced full type (a, A, B, F), 
such that A = a(9 t ) U a(%+), B = a(0;) U a(9 u+ ) and F = a(0_>.) U a(0.<_) U 
a(9^+) U a(0j-+) U a{9^). Note that a denotes the 1-type of v, and, informally 
speaking, A is the set of 1-types of elements realized above v, B is the set of 
1-types of elements realized below v, and F is the set of 1-types of the siblings 
of v and the elements realized in free position to v. 



Note that the number of 1-types is bounded exponentially, and the numbers 
of full types and reduced full types are bounded doubly exponentially in the size 
of the signature. 

For a given normal form FO [4-, ■[■+, — >, — >- + ] formula ip and a full type a, 
we say that a is if- consistent if an element realizing a cannot be a member of 
a pair violating the universal conjunct \/xyx(x,y) of ip, and has all witnesses 
required by ip. Formally, a is (^-consistent if for every 9 G &, and every a' G 
a(9) we have a(x) A a'(y) A 9{x,y) \= x( x iV) A x{v> x )> an d for every conjunct 
Vx(\i(x) =>• 3y(i]i(x,y) A tpi(x,y))) of <p, such that a(x) \= \i(x), there exists 
a 1-type a' G a(rji) such that a(x),a'(y) \= ipi(x,y). A proof of the following 
proposition is straightforward. 

Proposition 1. Let T be a tree and let ip be a normal form, FO [J., 4-+, — ^ ^- + ]- 
formula. Then T |= ip iff every full type realized in T is ip- consistent. 

We say that a full type a is combined of two full types a± and a.2 if a = a± = 
a2 and for each 9 € we have a(6*) = ai(6 l ) or a(6 l ) = a.2{9). Also the following 
fact is immediate. 

Proposition 2. Let ip be a normal form FO [J,, 4_ + , — >, — > + ]-formula, and let 
a be a full type combined of two (p-consistent full types c\\^ai- Then a is tp- 
consistent. 

3 Finite ordered trees 

This section is devoted to a proof of the following theorem. 

Theorem 1. The satisfiability problem for FO [^,4_ + , — >, — > + ] over finite trees 
is ExpSp ace- complete. 

The crucial fact is that every satisfiable formula has a model of exponentially 
bounded depth and degree. We prove this in two steps, and present a procedure 
looking for such small models, working in alternating exponential time. 

Short paths. First, let us see how the paths of a model can be shortened. 

Lemma 2. Let ip be a normal form FO [J.,4. + , — >, — > + ] formula, X its model, 
and v,w G T two nodes of T, such that % (= v-l+w and rftp i (w) = rftp i (w). 
Then the tree %' , obtained from 1 by replacing the subtree rooted at v by the 
subtree rooted at w, is a model of ip. 

Proof. It can be verified that for every u G T", if u^=w then ftp (u) = ftp i (u), 
and that ftp A (w) is combined of ftp 1 ^) and ftp (w). Thus, by Propositions Q] 
and [21 all types realized in 1' are (^-consistent, and 1' \= <p by Proposition [TJ □ 

Using the above lemma we can successively shorten J,-paths in a model of a 
normal form formula ip obtaining after a finite number of steps a model of ip in 
which on every path only distinct reduced full types are realized. Even though 
there are potentially doubly exponentially many reduced full types it can be 
shown that such a model has exponentially bounded 4+-paths. 



Lemma 3. Let ip be a normal form FO [4-, 4- + , — >, — > + ] formula satisfied in a 
finite tree. Then there exists a tree model of (p whose every \,-path has length 
bounded by 3 • (2 2 '' T ° ( - ip '') ! exponentially in \<p\. 

Proof. Let T |= ip be a tree in which on every J,-path only distinct full types are 
realized and let vi, V2, ■ ■ ■ ,Vk be a 4--path in X. Observe that the sets A, B, F in 
reduced full types of Vi behave monotonically. More precisely, if (oii,Ai,Bi,Fi) 
is the reduced full type realized by m, for 1 < i < k, then for i < j we have 
Ai C Aj, Bi D Bj and Fi C Fj. Thus along the path each of the sets A,B,F 
is modified at most 2l r °^^ times (since this is the number of possible 1-types). 
The number of reduced full types with fixed A, B, F is equal to the number of 
1-types, so the length of each path is bounded as required. □ 

Small degree. Now we observe that to provide all witnesses for V3 conjuncts 
of (p we only need nodes with at most exponential degree. 

Lemma 4. Let ip be a normal form FO [4-, 4-+, ~~ >, ~^ + ] formula and let % f= (p. 
Then there exists a model %' \= ip in which the number of successors of each 
node is bounded by 4 • 2 2 '' T °^'' . Moreover %' can be obtained by removing from 
X some number of elements (together with the subtrees rooted at them). 

Proof. We show first how to decrease the degree of a single node of T. Let v be 
a node of 1 of full type a v , and let U be the set of the children of v. For every 
element u G U let a u be its full type. We are going to mark some important 
elements of U and then remove all subtrees rooted at unmarked ones producing 
a model %'" \= ip. First, for every 1-type a, if a is realized in U precisely once 
then mark this realisation; if a is realized more than once then mark the minimal 
and the maximal (with respect to -> + ) realisations of a. Further, for every 1- 
type a, let U a — {u € U | a € a u (6±) U a«(#|4 + ). For each a mark min(2, \U a \) 
elements of U a . Note that so far we have marked at most 4 • |2 r °^)| elements 
of U. Assume that these (listed according to — > + ) are: u%, . . . , Uk- We call them 
primarily marked elements, and denote their set by Up. 

Consider the tree T" obtained from % by removing the subtrees rooted at 
elements of U \ Up. It can be verified that elements from T" \ Up retain in 1" 
their full types from T. Unfortunately, the — ^--connections among the elements 
of Up in %" may be inconsistent with ip. To fix this problem we mark some 
additional elements of U (at most exponentially many) between m and Uj+1, 
for all zQ For every i, consider the — >-chain C of elements of T between m and 
u i+ i. If C is empty then w i+ i is — ^-successor of Ui and there is nothing to do. 
Otherwise, let a be the 1-type of the successor w of Ui. Find the maximal (with 
respect to — ►+) element w' of type a in C, and mark it. The elements between Ui 
and w' will never be marked, so w' will become the — ^-successor of Ui in the final 
model %'" . Thus, m will retain in its full type its a Ui (9^) (singleton) set, and, 
due to our strategy of primarily marking maximal realisations of 1-types, also its 



4 Actually, this fragment of the construction combined with some earlier parts, repro- 
duces the small model theorem for FO 2 over words. 



a Ui (0=}+) set. This is not necessarily true for w' and its (singleton) a(#<_) set, 
and a w i{9^+) set. However, these sets will be equal, respectively, to a w {9^-), 
and a w {9 i= +) sets of w, which means that the full type of w' in %'" will be 
combined of two full types (of w and w') from X. We proceed recursively with 
the — Hchain of elements between w' and Uj+i. 

Note that the number of elements between m and Wi+i which are marked 
during this process is bounded by the number of 1-types. Thus we mark in total 
at most 4 ■ 2^°^^ • 2^°^^ elements of U, as required in the statement of this 
lemma. Let us denote the set of the marked elements Um- We construct X'" by 
removing from X all subtrees rooted at elements of U \ Um- It can be verified 
that all elements from T'" \ Um retain their full types from X, and that the full 
types of elements from Um in X'" are either retained from X or are combined 
of pairs of full types in X of elements from U. By Proposition [T] we have that 
X'" h <P- 

The desired model X' can be obtained by applying the described procedure 
in depth- first manner. □ 

Alternating procedure and complexity. We are ready to design a procedure 
checking if a given F0 2 [J,, -J.+ , — >, — > + ] formula ip has a finite tree model. By 
Lemma [T] we may assume that ip is in normal form. By Lemma [3] and Lemma [4] 
we may restrict our attention to models in which the length of each path and the 
degree of each node are bounded exponentially in \tp\. We present an alternating 
procedure working in exponential time. This justifies that the problem is in 
ExpSpace since, by [5], ExpSpace=AExpTime. The procedure first guesses 
the full type of the root and then guesses the full types of its children, checking 
if the information recorded in the full types is locally consistent, and if each full 
type is (^-consistent. Further, it works in a loop, universally choosing one of the 
types of the children and proceeding similarly. 

Procedure FD 2 [|, | + , -)•, ->+] -sat-test 

input: an F0 2 [|, \. + , —¥, — > + ] normal form formula ip 

— let maxdepth := 3 • |2 T °^)| 2 ; let maxdegree := 4 • 2 2 ^ To(v ^; 

— let level := 0; 

— guess a full type a such that a{9^) = a(9^+) — a{9^) = a(9^+) = 

6(0<_) = 6(0*=+) = a(0/) = 0; 

— while level < maxdepth do 

— if a is not ^-consistent then reject 

— if a(9\) U a(#|4 + ) = then accept 

— guess an integer 1 < k < maxdegree; 

— for 1 < i < k guess a full type type an; 

if not locally-consistent(a, on, ... , oik) then reject; 

— level := level + 1; 

— universally choose 1 < i < k; let a — on; 

— endwhile 
reject 



The function locally-consistent checks whether, from a local point of view, a 
tree may have a node of full type a whose children, listed from left to right, have 
full types oil, ... , otk- Namely, it returns true if and only if all of the following 
conditions hold: 

Horizontal conditions: 

(hi) a<(0«_) = {«,_!} for i > 1; fii(0<_) = 0; 

(h2) Oi(fl_) - {a z+1 } for i < k; a k (0^) = 0; 

(h3) a,(0^+) = a i .i(^)Ua i -i(J t t) for i > 1; &i(0^+) = 0; 

(h4) ai{e^+) = Oi+i(0_>) U a i+ i(0^+) for » < fc; a fe (0^+) = 0; 

Vertical conditions: 

(vl) a(0+) = {ai,...,a fc }; 

(v2) ai(6> t ) = {a} for 1 < i < k; 

(v3) a(6 u+ ) = Ui<i< fc (o*(»4.) u «*(%+)); 
(v4) aj(^ tt +) = a(0tJu a(0 t1 -+) for 1 < i < k; 

Free conditions: 

(fl) oi(^) = U J y i (%(^)Ua,(0u + ))Ua(0^ + )Ua(0^)Ua(0^)Ua(0^ + )UQ(0 / ) 

for 1 < i < fc. 

Lemma 5. Procedure FO 2 [J,, J, + , — s>, — s> + ] -sat-test accepts its input tp if and 
only if tp is satisfied in a finite tree. 

A matching ExpSpace- lower bound follows from [T3], where it was shown 
that a restricted variant of the two-variable guarded fragment with some unary 
predicates and a single binary predicate that is interpreted as a transitive relation 
is ExpSPACE-hard. It is not hard to see that the proof presented there works fine 
(actually, it is even more natural) if we restrict the class of admissible structures 
to (finite) trees. Thus we get the following corollary. 

Corollary 1. Over finite trees the satisfiability problem for each logic between 
GF 2 [J, + ] and FO 2 [|, !+,->■, ->•+] is ExpSPACE-comp/eie. 



4 Singular finite trees 

We start this section with establishing the complexity of F0 2 [J, + ]. 

Theorem 2. The satisfiability problem for FO [4-+] over finite singular trees is 
NExpTiME-comp/eie. 

To show the upper bound we observe that every singularly satisfiablc formula 
has a singular model whose all paths are bounded polynomially. This fact is a 
generalisation of Theorem 2.1.1 from [24], that every F0 2 [<] formula <p, singu- 
larly satisfiable over finite words, has a finite singular model with polynomially 
many elements. Actually, our work is strongly influenced by the construction 
from 24!, and, generally, can be seen as its adaptation to the case of trees. We 



describe here all the required constructions, but omit some proofs, as many of 
them are obtained by obvious adjustments of the corresponding proofs for the 
case of words. Thus, in order to fully understand all the details, we advise the 
reader to familiarise with Chapter 2 of 24 . 

Before going further we discuss the main differences with the case of words. 
The main idea from [23] is to show that for a given singular word 2U |= tp, a 
letter a G To and a given sub formula £(x) of tp there exists a division of 2U 
into polynomially many segments in which, at elements satisfying a, the value 
of £(x) is constant. In our case the role of those segments is played by slices, i.e., 
connected components of trees. We show that each path intersects polynomially 
many slices. In |24j left and right witnesses are considered. In our case they 
correspond to upper and lower witnesses (which, however, in contrast to the 
case of words, are not necessarily linearly ordered), but we must also deal with 
free witnesses. Finally, the small model is constructed by picking at most three 
witnesses for each slice. As the total number of considered slices in a tree may 
be exponential we have to be careful at this point, to avoid choosing too many 
witnesses from a single path. 

Now we turn to technical details. Recall that in the current scenario we 
have four order formulas x], + y, x=y, yi + x and x^y. We also use a shortcut: 
x\-,y — x-\. + y V x=y. The normal form from Lemma [T]is not very useful since it 
introduces fresh unary predicates that destroy singularity of models. Thus, we 
only slightly adjust formulas by converting them to existential negation form 
(ENNF). A formula tp G F0 2 [i + ] is in ENNF if it does not contain any universal 
quantifier, and negations only appear in front of unary predicates or existential 
quantifiers. Negations in front of order formulas are not allowed. Obviously, any 
formula ip £ F0 2 [4-+] is equivalent to a formula in ENNF of size at most 2\ip\. 

We may view our formulas as positive boolean combinations of order formulas 
and formulas with at most one free variable. 

Proposition 3. Let tp G F0 2 [4-+] be a formula in ENNF. Then there exists 
a number s G N, a positive boolean formula j3 in variables Zu, Z-, Z^+, Z^, 
Xi, . . . , X s , and formulas <pi,...,ip s G FO [|. + ] in ENNF, each with at most one 
free variable, such that p — f3(x\. + y, x=y, y\.+x, x^y, tpi,..., tp s ). Moreover p = 
(xl + y Aip) x i +y )\/ (x=y Aip^ x=y )\/ (yl + x /\ip) yi+x )\/ (x^y /\ip) x ^ y ) where p\ x i +y = 
/3(T, _L, _L, _L, ipi, . . . , p s ), and tp\g is analogously defined for the remaining 9-s. 

For a finite tree T and a set of nodes PCTwe define max(P) as the set of 
the maximal nodes from P and min(P) as the set of the minimal nodes from 
P, with respect to the order relation 4-+- For example max(T) is the set of the 
leaves and min{T) is the singleton consisting of the root of T. 

Lemma 6. Let C,\{y), ..., C,t{y) be FO [4-+] formulas with y as the only free vari- 
able and in ENNF, and let T be a finite singular tree. Let /3 be a positive boolean 
formula in the variables Z^ + , Z-, Z^+, Z^, Y\, . . . , Y t , let ip(x,y) = f3(x] r+ y,x=y, 
yi + x,x^y, Ci(y),.-.,Ct(2/))> and let tp(x) = 3ytfj(x,y). Let P' := {u G T\1 \= 
ip\xl+y[u,v], for some v s.t. i4+u} 7 Q' := {u G T\T \= ip M+x [u,v], for some 
v s.t. V-\. + u}, R' := {u G T\T \= ipij X7 c y [u,v], for some v s.t. w/^v}. Set P = 



max(P') and Q — min(Q' U R'). Then for all u £ X, X (= ^>[w] i/f t/iere exists 
peP s.i. u4_»p or i/iere exists q £ Q s.t. q\.*u or X \= ip\ x =y[Uj u]. 

Remark. Notice that on every path in X there is at most one point from P and 
at most one point from Q. 

Let a € To be a letter, X a finite singular tree, and S a set of nodes of X. Then 
by S a we denote the set of nodes in S where the letter a occurs. We also say 
that S is a tree slice iff it induces a connected (with respect to the symmetric 
closure of the child relation 4.) subgraph of X. 

Lemma 7. Let ip £ FO [4-+] be a formula in ENNF and with one free variable, 
let % be a finite singular tree, and let a £ tq. There is a set SCT which is a 
union of tree slices in X such that: for every u £ T a we have X |= <p[u] iff u £ S; 
and every path in X intersects at most \<p\ 2 tree slices from S. 

The proof is inductive: if p = 3yip(x,y), for ip(x,y) = f3{x] r+ y,x=y,y] r+ x, 
x^y, £i(x), . . . , £ s (x), Ci(y); • • • > Ct{y)), then we consider the slices obtained in- 
ductively for the formulas £a(x). The slices for different er-s may overlap. Their 
endpoints determine a more refined division into slices, such that in each slice, 
on nodes carrying a, the values of all £<r(x) are constant. In each such slice we 
apply Lemma [6] to introduce new divisions. Now arguments and calculations 
similar as in the proof of the corresponding Lemma 2.1.10 from [24] lead to the 
desired claim. 

Lemma 8. Let p be an FO [\. + ] formula over a signature t. If p is satisfied in 
a singular tree, then p is also satisfied in a singular tree, in which the length of 
every path is bounded by 6 • \t\ ■ \p>\ 3 ■ 

Proof. We assume that ip is in ENNF. Let X |= <p be singular, and let p\, . . . , (pk 
be the subformulas of p of the form 3xip for some variable x and some formula 
ip. For every k £ [1, . . . , k] we use Proposition [3] to find a positive boolean for- 
mula such that ip K (x,y) = P(xl+y,x=y,yl + x,x>/'y ) £i(x),...,€ s (x),Ci(y), ■■■, 
Ct(y))- For every a £ r and every a £ [1, . . . , s] let S„ be a set as in Lemma [7] 
applied to the formula £ CT (x) and a, where every path intersects at most |£ CT | 2 
tree slices from S%. Thus there is a set 3° of tree slices / such that: every path 
in X intersects at most 2 • X^o-eri si l^l 2 °f them; {J Ie:}a 1 — 1; and there are 
£1, . . . ,£f £ {T, _L} such that X |= ^[u] iff ££ = T for every u £ I satisfying 
a[u]. For each I e 3° we consider the formula p ! K = 3yip T K (x, y), where ipi(x, y) = 

P(xUy,x=y,yUx,x^y^(,...,^,a(y),---,Q(y))- Let P" ■- {v e X | x f= 

^^4. + «/[ u ' u ] for a11 *4+ u }> Q" := {« G 2 I X h ^ ra4 . +2: [w,w] for all f|+u}, 
i?" := j> e X I X (= ^rx/a^j^] for a11 U ?M, and let p ' = max(P"),Qi = 
min(Q"),Ri = mai(fi"). Let T° - U/ e3s (^ UQ 7 Ui? 7 ), T' = U ree[ i, fc] , aeTo ^, 
and let X' be the restriction of X to T". Lemma |6] can be used to prove that 
X' |= ip. Also it can be shown that the paths of X' are bounded as required. □ 

Corollary 2. Let p be a singularly satisfiable FO [\. + ] formula. Then it is sat- 
isfied in a singular tree whose number of nodes is exponential in \p\. 
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Proof. Let 1 \= p be a singular tree over the signature r, let T be the frame of 
T. By Lemma [5] we may assume that all its paths are bounded polynomially. Let 
ip' be the normal form formula over signature r' from the statement of Lemma 
[T] By that lemma ip' is satisfiable in a model %' based on the frame T ■ By 
Lemma 0] we can remove some subtrees from T' to obtain a model T" |= p' with 
exponentially bounded degree. Again by Lemma [TJ the restriction of T" to the 
original signature r is a singular model. As its paths are bounded polynomially 
and the degree of nodes is bounded exponentially, the total number of nodes is 
bounded exponentially in \>p\ as required. □ 

Corollary [2] justifies the upper bound from Theorem [21 since for a given <p> we 
can nondeterministically guess its exponential model and then verify it. 

The exponential bound on the degree of nodes in singular models of F0 2 [4+] 
formulas is essentially optimal. Indeed, let us see that there exists a formula of 
size polynomial in n in whose every model the root has 2" children. We use unary 
predicates root, elem, bo, ... , b n —i, and say that all elements in elem are children 
of the root: \/x(root{x) ■& -By yi + x) A \/x(elem(x) <^> (^3y(yl + x A ~^root(y))) . 
We think that each v in elem encodes a number < N(v) < 2™ such that the 
i-th bit in its binary representation is 1 iff the formula 5i(x) — 3y(x^ + y A bi(y)) 
is satisfied at v. In a standard way we can now write a formula first(x) which 
says that N(x) — 0, a formula last(x) stating that N(x) = 2" — 1, and a 
formula succ(x, y) saying that N(y) = N(x) + 1. Now the formula 3x first(x) A 
\/x(-ilast(x) => By succ(x, y)) is as required. This idea can be easily employed 
to obtain NExpTime- lower bound in Theorem [2J 

It appears, that the ability of speaking about pairs of nodes in free position 
is crucial for NExpTiME-hardness. Indeed if we allow only guarded formulas, 
we get P Space complexity. The upper bound in the following theorem can be 
proved by bounding polynomially not only the length of the paths but also the 
degree of the nodes in models of GF 2 [J, + ] formulas. 

Theorem 3. The satisfiability problem for GF [J, + ] over finite singular trees is 
P 'Space- complete. 

Finally we show that augmenting GF [! + ] with any of the remaining binary 
navigational predicates leads to ExpSPACE-lower bound over singular trees. 

Theorem 4. The satisfiability problem over singular trees for each of the logics 

GF 2 [! + ,|], GF 2 [! + ,->] 7 GF 2 [| + ,^+] is ExpSPACE-/iard. 

5 Future work 

One possible direction of a further research could be investigating the case in 
which infinite trees are admitted as models. It seems that the complexity results 
we have obtained for finite trees can be transfered to this case without major 
difficulties. It could be interesting to examine also the cases in which T^ n contains 
4- but does not contain \ r+ . A related result is obtained in [B], where NExpTime- 
completeness of FO with counting quantifiers and arbitrary number of binary 
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symbols, of which fixed two have to be interpreted as child relations in two trees. 
The trees considered in [6] are, however, ranked and unordered. 

Acknowledgement. Similar results were obtained independently in [3]- The 
two works were merged into a single paper [2J. 
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A Proof of Lemma [5] 

Proof. Assume that tp is satisfiable. By Lemma [3] and Lemma [4] there exists a 
small model 1 \= tp. The procedure accepts ip by making all its guesses in ac- 
cordance to T, i.e. in the first step it sets a to be equal to the full type of the 
root of T and then in each step it sets on to be the full type of the i-th child of 
the previously considered element. In the opposite direction, from an accepting 
(tree-)run t of the procedure we can naturally construct a tree structure Tt, with 
1-types of elements as guessed during the execution. Our procedure guesses actu- 
ally not only 1-types but full types of elements. The function locally-consistent 
guarantees that the full types of elements in Tt are indeed as guessed. Since the 
procedure checks if each of those full types is (^-consistent, then by Proposition 
[T]wc have that %t \= ip. □ 



B Proof of Lemma 2J] 

Proof. Suppose T \= <p[u], then there is v £ T such that 1 |= i/)[u,v]. If u^v 
then X |= ip^ xi cy[u,v\. By definition u £ R' and thus there is q £ Q such that 
q\, t u. The cases v\, + u and u\. + v are similar. If u = v then T \= tp[u,u] and thus 
2- |= ip\x=y[ u J u ]- m the opposite direction, suppose there is q £ Q such that 
q\.,u. Notice that if q\.,u then for every node v we have q^v =>■ w/'V. So if q £ R' 
then there is a node v such that % |= i\>\ xr p v \u,^v\. Otherwise q £ Q' and there 
exists a node v such that T (= ip\yi+x[u,v]. In both cases there is a node v such 
that % |= t/)[u, v] and thus % |= <p[u]. The case if there is p £ P such that u\.,p is 
similar. If T |= 4>\ x =y[u,u] then T |= 4>[u,u] and thus % \= ip[u]. □ 



C Proof of Lemma [7] 

Lemma [T] Let tp £ FO [4_ + ] over the alphabet To in ENNF and with one free 
variable, let X be a tree over the alphabet tq, and let a £ tq. There is a set S C T 
which is a union of tree slices in T such that for every i £ T a : % \= tp[u] iff 
u £ S; and every path in T intersects at most \tp\ 2 tree slices from S . 
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Proof. Induction on the structure of f. We consider only the case when if = 
3ytp(x,y). Otherwise the proof is similar as in the corresponding Lemma 2.1.10 
from [24 . Let 

ip(x, y) = /3(xl+y, x=y, yl+x, x^y, ^(x), ..., £ s (x), & (y), ■■■, Ct(y)) 

Applying the inductive hypothesis to the formulas £ CT , a £ [l,s], let S a be 
the set as described in the statement of this lemma, and let Ifaki)* • - • > !{<r,k a ) 
be tree slices such that every path in T intersects at most |£ CT | 2 of them and 
S a = Uj=i I((T,l)- We define the set H = U* =1 S a U {r} U L, where r is the root 
of T and L is the set of leaves in X. 

Looking at each tree slice I bounded by points from H, the truth values of 
the formulas £1, . . . , £ s remain constant among all points from I a . Let £*,...,£* 
be these respective true values. Thus, on all nodes from I a , ip(x) is equivalent 
to 3yf3(xl + y, x=y, yi+X, x^y, £f, . . . , £*, Ci(y), • • ■ , Ct (&/))• This formula satisfies 
the requirements of Lemma |51 so that the truth of ip(x) over I a is determined 
by the relative position of x with respect to P, Q and by truth of the formulas 
Ci(x), ■ • ■ , (t(x) for the nodes in between P and Q. We now can construct the 
set S of all nodes from 1 a where <p(x) is true as the union of tree slices bounded 
by: points from H; points that result from applying this lemma to the formulas 
(i(x), . . . , (t(x); or points from P and Q added on every tree slice /. 

We set a path in T and count the number of tree slices from S this path 
intersects. An intersection of a path from 1 with a tree slice is an interval. By 
the remark made after Lemma [H] we know there is at most one point from P and 
Q added on every path in /, thus there is at most one point pGP and q <E Q 
on every interval. This means we can use the calculations in Lemma 2.1.10 from 
[24] to achieve at most |<p| 2 intervals on every path in T. 

a 



D Remaining part of the proof of Lemma [S] 

We argue that T' \= ip. To see this, we show by induction that for every sub- 
formula 77 of ip with at most one free variable and all u £ T", % |= rj[u] iff 

S* MM- 

If T) is an atomic formula or a boolean combination of other formulas then 
the claim is obvious. Suppose rj = tp K for some k G [1, k]. 

Suppose that u £ I and T \= rj[u]. Then there is a v £ 1 such that T |= 
ip K [u,v]. Let Je3" such that u £ I. We find v £ T" such that 1 |= ^ re [u,u] as 
follows: if m^+w then there is a « e P; such that v\,„v, if i4 + u then there is a 
v £ Q/ such that 04-»v, if u^v then there is a -0 £ i?/ such that v],,v, if u = v 
then we set v = u. Clearly T' (= ^k[u, {*] and thus T' (= r/[u]. Suppose now that 
u £ T" and 1' (= 77 [u]. Then it is easy to see that T |= r/[u}. 

So because 1 |= tp, we have 1' |= tp. We now show that paths in T' have 
a bounded length. Set / £ 3° and the formula ^ T K {x,y) as in fragment of this 
proof from the main part of the paper. For every b £ tq and every i £ [l,t] let 
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Sf be a set as in Lemma [7] applied to the formula Q(y), where S^ intersects at 
most \Q\ 2 tree slices on every path. Thus there is a set M%, of tree slices I' such 
that every path in 1 intersects at most 2 ■ J2ie\i t] \d\ 2 °f them and {J& b K = 1. 
Set a path in X and let 3« be the set of intervals that are the intersections of 
this path with tree slices from M.^.. We claim that there is at most one element 
from Pj on every J £ Z b K - Suppose we have u £ I and vi, v% e J fl P\. We show 
that 1 \= ^ xir+y [u,vx] iff T h V^+j/K^]- Indeed recall that ^{ {xi+y {x,y) = 
/3(T, _L, _L, _L, £(, . . . , £%, (i(y), ■ ■ ■ , (t(y))- Thus the boolean value of /3 depends 
on the boolean values of Q(v)- But we assumed that they are the same for V\ 
and V2- Since Pi is a set of maximal nodes then v\ — v 2 . We can do analogous 
calculations for the sets Qi and i?/. Altogether the length of every path in X' is 
at most 3 • 2 • E« e[1 , fc ],a ero ,i e[M iCil 2 < 6 • |r| • |^| 3 . □ 



E Complexity of GF 2 [d, + ] over singular trees 

In this section we expand our arguments for PSpace upper bound for GF 2 [4.+] 
over singular trees. 

A GF [4-+] formula <p is in normal form if ip = /\ ieI \/xy(i]i(x, y) ^> ipi(x, y))/\ 
/\ ieJ \/x(Xi(x) =^ 3y(r)i(x,y) A i/ji(x,y))), for some disjoint index sets I and J, 
where rji is a guard of the form x], + y, yi + x or X=y, Aj(x) is an atomic formula 
a(x) for some unary symbol a, and tpi(x,y) is a boolean combination of unary 
atomic formulas. 

We can prove a slightly weaker counterpart of Lemma [1] for GF 2 . Namely, 
we show that satisfiability of a GF 2 [|. + ] formula can be reduced to satisfiability 
of a normal form GF 2 [|. + ] formula nondeterministically. 

Lemma 9. There exists a nondeterministic procedure GF 2 [4- + ]-normalisation, 
such that for a GF [\, + ] formula ip over a signature t, and a tree frame T con- 
sisting of at least two nodes the following holds. The formula ip is satisfiable over 
T (singularly satisfiable over T) if and only if there exists a polynomial execution 
of GF 2 [|. + ]-normalisation on ip producing a normal form GF [4- + ] formula <p' 
over a signature r' consisting of r and some additional unary symbols, satisfiable 
over T (satisfiable over T in a model which restricted to r is singular). 

Proof. By the work from [53] it follows that for a given GF 2 formula ip over a sig- 
nature t there exists a polynomially computable formula ip' = /\ iG j{{^x Ti(x)) ■&■ 
3x(Xi(x) A ipi(x)) A ((Vas n{x)) V (Vas -r< (»)))) A /\ ieJ Bx^x) A ^(x)) A ip", for 
some disjoint index sets / and J, over a signature consisting of r and some ad- 
ditional unary predicates, where Aj(x) is an atomic formula a(x) for some unary 
symbol a, ipi{x) is a boolean combinations of atoms, ip" is in normal form, and 
none of r^-s is used as a guard, such that <p and ip' are satisfiable over the same 
tree frames. Now for each i £ / we guess whether Va; rj(x) is satisfied or not 
and replace the occurrences of Ti{x) and Ti{y) in ip' by T or 1 appropriately. 
We thus get a conjunction of a normal form formula, some formulas of the form 
3x(\i(x) Aipi(x)), and some formulas of the form Sx(Xi(x) Aipi(x)). A formula 
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of the last type can be rewritten as \/xy(x = y =4» -i\i(x) V -iipi( x ))- To deal 
with purely existential statements we introduce a fresh unary predicate root and 
make it true precisely at the root of a tree by adding the conjunct \/xy(x=y =>■ 
(root(x) <^> -i3y(yj, + a;))). A formula 3x(\i{x)Aipi(x)) can be now rewritten as the 
normal form conjunct \/x(root(x) =4* 3y(xl + y A(Xi(y) Aipi(y)) \/(Xi(x) Aipi(x)))). 
This transformation works properly over trees containing at least two nodes. The 
describe nondeterministic procedure is thus the required GF 2 [4_ + ] -normal is at ion 
procedure. □ 

Let us see that in an arbitrary (not necessarily singular) model 1 of a normal 
form GF [\. + ] formula p> we can find a submodel in which the degree of nodes 
is bounded polynomially in \ip\ and in the length of the paths of T. As we are 
able to shorten paths in singular models to length polynomial in \(p\, this will 
lead to a polynomial bound on the degree of nodes in singular models of GF 2 [I + ] 
formulas (which, as we have seen, contrasts with the case of F0 2 [4-+]). 

Lemma 10. Let tp be a normal form GF [\, + ] formula and let 1 \= tp. Then 
there exists a submodel %' \= ip of % in which the number of successors of each 
node is bounded by max ■ \tp\, where max is the length of the longest path in T. 

Proof. Let v be the root of T. For every conjunct tpi of tp of the form Vx(Ai (a;) =>■ 
3y(r/i(x,y) Aipi(x,y))), with rji(x,y) — x^+y pick a witness w for v and tpi, mark 
w and mark all the elements u such that 1 (= u\.+w, i.e., the elements on the 
path from the root to w. Remove all subtrees rooted at successors of v containing 
no marked elements. Repeat this process for all the elements v of T, say, in the 
depth-first manner. Note that the structure obtained after each step is a model 
of tp, since we explicitly take care of providing lower witnesses, and the upper 
witnesses are retained automatically as every element which is not removed 
from the model is kept together with the whole path from the root from the 
original model T. Let 1' be the structure obtained after the final step of the 
above procedure. Observe that the number of marked descendants of an element 
located at level I is bounded by (/ + 1) • \ip\, thus the degree of each node of %' 
is bounded by max -\ip\ as required. □ 

We recall the statement of Theorem [3] from the main part of the paper, and 
prove its part related to the upper bound. Lower bound is proved in the next 
section. 

Theorem [3J The satisfiability problem for GF [\. + ] over finite singular trees is 
PSPACE-complete. 

Proof. We show here that the problem belongs to PSpace. We check satisfia- 
bility of a given GF [i. + ] formula (p over signature r over singular trees in the 
following way. We run the non-deterministic procedure GF 2 LL + ] -normalisation 
(see Lemma [9]) and obtain a formula ip' over signature r' . On this formula we 
run a natural adaptation of the procedure FO 2 LI, ! + , —t, — K 1 "] -sat -test in which 
maxdepth is set to 6- \t\ ■ \<p\ 3 , maxdegree is set to maxdepth- \ip'\, the conditions 
verified in function locally-consistent are appropriately simplified (only a(0n + ), 
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and a(#-|-|-+) are relevant this time), and which checks if all 1-type contain pre- 
cisely one positive unary atom from r. 

The described alternating procedure works in time bounded polynomially in 
<p, so, as APTime=PSpace [5], it can be also implemented to work in deter- 
ministic polynomial space. We claim that it accepts ip iff ip has a finite singular 
tree model. Assume that ip is accepted. This means that ip' has a tree model 
which restricted to r is singular. By Lemma EH it follows that (p has a singular 
model. In the opposite direction, let 1 \= ip be singular, and let T be the frame 
of 1. By Lemma|8]we can assume that the depth of T is bounded by 6 • |r| • \ip\ 3 . 
By Lemma [SJ GF 2 [,J. + ] -normalisation can produce ip' which is satisfiable over 
T, say in a model X'. By Lemma 21 ip' is also satisfied in a submodel X" of T' 
in which the degree of every node is bounded by 6 • \t\ ■ \(p\ 3 ■ \ip'\. Thus our 
alternating procedure can make all its guesses in accordance to 1" and accept. 
□ 

F Lower bounds for logics over singular trees 

Theorem 5. The satisfiability problem for FO [4+] over singular finite trees is 
NExpTlME-/iard. 

Proof. We give a reduction from the satisfiability problem of unary FO , which is 
known to be NExpTiME-complete (see e.g., [7]). For a given FO 2 formula ip over 
a unary signature r we construct an equisatisfiable FO [4-+] formula T(<p) over 
the signature rU{4. + , elem} where elem is a fresh unary predicate. Without loss 
of generality we may assume that ip is built from variables x, y, unary predicate 
symbols, boolean connectives A, -i and existential quantification. 
Now we inductively define the translation T{ip). 

T(p(x)) = 3y xi + y Ap(y) 
T{-*p) = -,T(ip) 

T(i Pl Aip 2 )^T(ip)AT{ip 2 ) 
T(3x ip) — 3x elem(x) A T(ip) 

Note that T(ip) is a formula of length linear in (\ip\). It remains to be shown that 
if and T(ip) are equisatisfiable. 

For one direction, assume that 21 is a model of (p. Construct a tree T such 
that all elements of the universe of 21 are immediate successors of the root of 
% and are labeled elem; each such element e has as many immediate successors 
as there are predicates in r that are true of e, and each such successor is a leaf 
labeled with a distinct predicate true of e in 21, see Figure Q] It can be easily 
proved by induction on the structure of <p that T is a (singular) model of T(ip). 

For the other direction assume that X is a model of T(<p). Construct a struc- 
ture 21 such that the universe of 21 is the set of nodes labeled elem in T and 
for all elements e and all predicates p, p(e) is true in 21 if and only if there is 
a node e' labeled p that is below e in 1. Again it is easy to prove by structural 
induction that ip is true in 21. □ 
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Fig. 1. Representation of a structure over the signature {p, q}. There are two elements 
in the universe; the first belongs to the relations p and q, the second to p. 



Theorem 6. The satisfiability problem for GF [4-+, 4-] over singular finite trees 
is ExpSPACE-/iard. 

Proof. We give a reduction from GF 2 [4- + ] over arbitrary trees. The idea of the 
encoding is the same as in Theorem[5j a node e in a tree is modeled by a singular 
node labeled elem with immediate successors encoding predicates true in e. The 
binary predicate 4-+ is used to preserve the structure of the tree, the additional 4- 
predicate gives the access to nodes modeling unary predicates. In the following 
reduction, for a given GF 2 [4. + ] formula <p over a signature r = t U {4-+} we 
construct a GF [4-+, 4-] formula over the signature r U {4-, elem} that is satisfiable 
over singular trees if and only if <p is satisfiable over trees. 

Let us start with a formula ensuring that the underlying structure is an 
encoding of a tree. The formula tree is defined as the conjunction of 

A Mx p(x) => Vy y\.+x =>• elem(y) 

p£TQ\j{elem} 

with 

Mx elem(x) =>• Vy x4-+2/ =^ \J p(y)- 

p£TQ\j{elem} 

It ensures that (unless the tree is trivial, i.e., no node is labeled at all) each node 
is labeled with some predicate symbol, all internal nodes are labeled elem and 
only leaves may be labeled with predicates from To. 

Without loss of generality we may assume that the formula ip is built from 
unary atoms, boolean connectives A, -i and guarded existential quantification. 
The translation T((p) of a formula ip is defined inductively as follows. 

T(p(x)) = 3y xiy A p(y) 
T(^) = -,T(<p) 
% 1 A^ 2 )=T(^)A% 2 ) 
T{3x p(x) A ip{x)) = 3x elem(x) A T(p(x)) A T(ip(xj) 
T(3y xi + y A ip(x, y)) = 3y xl + y A elem(y) A T(ip(x, y)) 
T(3y yl + x A ip(x, y)) = 3y yi + x A elem(y) A T(ip(x, y)) 
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Note that T(ip) is a guarded formula of length linear in (|<^|). Again a simple 
inductive argument shows that ip is satisfiable if and only if tree A T(<p) has 
a singular tree model. □ 

Theorem 7. The satisfiability problems for GF 2 [4. + , — !>] and GF 2 [4. + , — s>+] over 
singular trees are ExpSPACE-hard. 

Proof. We follow the construction from |13| and give a generic reduction from 
AExpTime. Consider an alternating Turing machine M working in exponential 
time. Without loss of generality we may assume that M works in time 2™ and that 
every non-final configuration of M has exactly two successor configurations. Let 
w be an input word of size n. Following [13] we construct a formula whose models 
encode accepting configuration trees of machine M on input w. In [13] each 
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Fig. 2. Left: frame of a configuration tree in |T3]; nodes m and ri2 are siblings. Right: 
frame of a configuration tree in our encoding; nodes m and ri2 are not siblings. 



configuration is represented by 2™ elements of a tree, each of which represents 
a single cell of the tape of M (see left part of Figure [2]) . Each such node is then 
labeled with unary predicate symbols from the set {Ci, . . . , C n , P\, . . . , P n } to 
encode the number of a configuration (i.e., the depth of the configuration in the 
computation tree) and its position (i.e., the number of a cell) in the configuration: 
Ci (x) is true if the «-th bit of the configuration number is 1 and Pi (x) is true if 
the i-th bit of the position number is 1. Additional predicate symbols are used 
to encode the tape symbol and the state of the machine (if it is necessary, i.e., 
if the head of of the machine is scanning the cell under consideration). Here, 
to encode the numbers, we use additional In elements that are siblings of the 
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node representing a cell, see right part of Figure [3J Each of these elements stores 
information about a single bit using one of two unary predicates zero or one. 
Then the atomic formulas Ci{x) and Pi(x) are simulated by formulas 

By x^ + y A Pathi(y) A one(y) and respectively By x— > + y A Path n+ i(y) Aone(y) 

where the subformula Pathi(y) is defined recursively as follows. For the logic 
GF 2 [| + ,^] we define 

Patho(y) = —E\x x—¥y 
Pathi+x{y) = 3x x-^y A Pathi(x) 

and for the logic GF 2 [| + , — !>+] we define 

Path>o(y) — -3x x— > + y 
Path>i + i(y) = 3x x— > + y A Path>i(x) 

Pathi(y) = Path>i(y) A ->Path>i+i(y). 

Note that in both cases the formula Pathi is guarded and has polynomial length. 
The negated atomic formulas -iCj(x) and -iPi(x) are simulated using predicate 
zero instead of one. 

Now, having the ability to count, we may encode tape symbols and states of 
the machine by simply using more siblings, and we may follow the lines of the 
construction in [T3] to encode the computation of M. The only remaining subtle 
point is that in |13j the two successor configurations are siblings in a computation 
tree while here they must not be siblings in order not to mess up the information 
about numbers — this may be simply done by rooting the two configurations at 
different nodes as shown on Figure □ 



Theorem 8. The satisfiability problem for GF [J. + ] over singular trees is PSpace- 
hard. 

Proof. We propose a reduction from the satisfiability of quantified boolean for- 
mulas, QBF. Let ip be an instance of QBF problem. Without loss of generality 
we may assume that ip is of the form 

3vk ■ ■ ■ 3v 2 Vviip' 

where the number of all quantifiers (fc) is even, all even-numbered variables are 
existentially quantified, all odd-numbered variables are universally quantified 
and ip 1 is a propositional formula over the variables V\ , . . . , u&. 

We now translate the formula tp to a formula over the signature 

r = {root, leaf, true, false, 4-+} 

such that ip is true if and only if its translation is satisfiable over singular trees. 
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First, for i G {0,...,fc} we define auxiliary formulas depth i and height}. 
Let depth (x) = root(x) and for i > 1 let depth^x) = 3y xj, + j/ A depth i _ 1 {y). 
Intuitively, the formula depth^x) expresses that the node x occurs at distance 
at least i from the root. Let height (x) = leaf(x), height x (x) — depth k (x) and 
let height^x) — depthk+i-i(x) A^depth k+2 _ i (x) for i > 1. For z > the formula 
height ^(x) expresses that x is a node at depth exactly fc+1 — i\ in the construction 
below, for i > 0, the formula height ,(x) will mean that the subtree rooted at x 
has height z. Note that height t (x) is a guarded formula of length linear in z. 

In the following construction a model of the translation of "0 is a tree that 
describes a set of valuations justifying that ip is true. It is a binary tree of depth 
fc + 1 where every path describes a valuation of variables vi,...,Vk- Every node 
at height i is labeled either true or false, which corresponds to a value of the 
variable Vi under a given valuation. Every non-leaf node at odd height i has two 
successors corresponding to the universally quantified variable i>j+i; every node 
at even height i where i > has one successor corresponding to the existentially 
quantified variable i>i+i. If k > then let treek be the conjunction of 

3x root(x), (1) 

Vx root(x) =4- (3y x4_ + y A height k (y) A (true(y) V false (y)), (2) 

Vx true(x) => ( height t (x) =4> 

((3y x| + y A height ^(y) A true{y)) (3) 

A (3y x| + y A height ^(y) A false (y))) ) 

for all even numbers 2 < z < fc, 
Vx false (x) =>■ ( height t (x) =>■ 

((3y x| + y A height ^(y) A true{y)) (4) 

A (3y x| + y A height .^(y) A false (y))) ) 

for all even numbers 2 < z < fc, 
Vx true(x) =>■ ( height t (x) =>■ 

3y x| + y A height ^(y) A (true(y) V false (y))) (5) 

for all odd numbers 3 < z < fc, 
Vx false (x) =>■ ( height t (x) =>■ 

3y x| + y A height ^(y) A (true(y) V false (y))) (6) 

for all odd numbers 3 < z < fc, 
Vx true(x) =*> ( height 1 (x) => 

(By x| + y A tea/ (?/))), (7) 

Vx false (x) => ( height 1 (x) =>■ 

(3yx| + yAW(y)))- (8) 

In the case of A; = the formula ireeo boils down to 3x root(x) A Vx root{x) => 
(3y x-l+y A leaf{y)). Note that iree^ is a guarded formula of length polyno- 
mial in fc. Now we inductively define the translation T{ip') of the quantifier-free 
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formula ip 1 . 

T(true) = true 
T (false) = false 

T(vi) — 3y yi + x A height t (y) A true(y) 

%!V^)=%)VTfe) 

Note that T(ip') is a guarded formula of length polynomial in (\ijj'\ + k). It is 
not difficult to prove by induction on k (and by nested structural induction on 
propositional formulas with free variables v±, . . . , Vk) that ip is true if and only if 
treek A Vx leaf(x) =>■ T(-)p') has a singular tree model. Each node labeled leaf in 
such a model uniquely determines a path to a node labeled root and such a path 
corresponds to a valuation of the variables v\, . . . ,Vk that makes the formula ip' 
true. □ 
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